Dataset statistics
| Number of variables | 17 |
|---|---|
| Number of observations | 46713 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 35 |
| Duplicate rows (%) | 0.1% |
| Total size in memory | 6.1 MiB |
| Average record size in memory | 136.0 B |
Variable types
| Categorical | 4 |
|---|---|
| Text | 1 |
| Numeric | 11 |
| DateTime | 1 |
| Dataset has 35 (0.1%) duplicate rows | Duplicates |
Bdrms is highly overall correlated with Fbath and 2 other fields | High correlation |
District is highly overall correlated with Nbhd | High correlation |
Fbath is highly overall correlated with Bdrms and 2 other fields | High correlation |
Fin_sqft is highly overall correlated with Bdrms and 4 other fields | High correlation |
Lotsize is highly overall correlated with Year_Built | High correlation |
Nbhd is highly overall correlated with District and 2 other fields | High correlation |
Nr_of_rms is highly overall correlated with Sale_price | High correlation |
PropType is highly overall correlated with Nbhd and 1 other fields | High correlation |
Sale_price is highly overall correlated with Nr_of_rms | High correlation |
Stories is highly overall correlated with Fin_sqft and 2 other fields | High correlation |
Style is highly overall correlated with Fin_sqft and 4 other fields | High correlation |
Units is highly overall correlated with Bdrms and 4 other fields | High correlation |
Year_Built is highly overall correlated with Lotsize | High correlation |
PropType is highly imbalanced (97.8%) | Imbalance |
Hbath is highly imbalanced (54.9%) | Imbalance |
Fin_sqft is highly skewed (γ1 = 27.23746815) | Skewed |
Units is highly skewed (γ1 = 160.0138676) | Skewed |
Bdrms is highly skewed (γ1 = 209.8119759) | Skewed |
Nr_of_rms has 24929 (53.4%) zeros | Zeros |
Reproduction
| Analysis started | 2024-08-04 08:05:56.613017 |
|---|---|
| Analysis finished | 2024-08-04 08:06:44.918019 |
| Duration | 48.31 seconds |
| Software version | ydata-profiling vv4.9.0 |
| Download configuration | config.json |
PropType
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 365.1 KiB |
| Residential | |
|---|---|
| Commercial | 215 |
| Exempt | 4 |
| Condominium | 1 |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 10.994969 |
| Min length | 6 |
Characters and Unicode
| Total characters | 513608 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Residential |
|---|---|
| 2nd row | Residential |
| 3rd row | Residential |
| 4th row | Residential |
| 5th row | Residential |
Common Values
| Value | Count | Frequency (%) |
| Residential | 46493 | |
| Commercial | 215 | 0.5% |
| Exempt | 4 | < 0.1% |
| Condominium | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| residential | 46493 | |
| commercial | 215 | 0.5% |
| exempt | 4 | < 0.1% |
| condominium | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 93205 | |
| i | 93203 | |
| l | 46708 | |
| a | 46708 | |
| t | 46497 | |
| n | 46495 | |
| d | 46494 | |
| R | 46493 | |
| s | 46493 | |
| m | 436 | 0.1% |
| Other values (8) | 876 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 466895 | |
| Uppercase Letter | 46713 | 9.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 93205 | |
| i | 93203 | |
| l | 46708 | |
| a | 46708 | |
| t | 46497 | |
| n | 46495 | |
| d | 46494 | |
| s | 46493 | |
| m | 436 | 0.1% |
| o | 217 | < 0.1% |
| Other values (5) | 439 | 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 46493 | |
| C | 216 | 0.5% |
| E | 4 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 513608 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 93205 | |
| i | 93203 | |
| l | 46708 | |
| a | 46708 | |
| t | 46497 | |
| n | 46495 | |
| d | 46494 | |
| R | 46493 | |
| s | 46493 | |
| m | 436 | 0.1% |
| Other values (8) | 876 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 513608 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 93205 | |
| i | 93203 | |
| l | 46708 | |
| a | 46708 | |
| t | 46497 | |
| n | 46495 | |
| d | 46494 | |
| R | 46493 | |
| s | 46493 | |
| m | 436 | 0.1% |
| Other values (8) | 876 | 0.2% |
Address
Text
| Distinct | 38081 |
|---|---|
| Distinct (%) | 81.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 365.1 KiB |
Length
| Max length | 35 |
|---|---|
| Median length | 32 |
| Mean length | 15.57727 |
| Min length | 12 |
Characters and Unicode
| Total characters | 727661 |
|---|---|
| Distinct characters | 46 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 30673 ? |
|---|---|
| Unique (%) | 65.7% |
Sample
| 1st row | 3033 N 35TH ST |
|---|---|
| 2nd row | 1908 E WEBSTER PL |
| 3rd row | 812 N 25TH ST |
| 4th row | 959 N 34TH ST |
| 5th row | 3209 W WELLS ST |
| Value | Count | Frequency (%) |
| st | 30817 | 16.4% |
| n | 20541 | 10.9% |
| s | 13247 | 7.0% |
| w | 10853 | 5.8% |
| av | 10730 | 5.7% |
| e | 2093 | 1.1% |
| pl | 1739 | 0.9% |
| dr | 978 | 0.5% |
| bl | 778 | 0.4% |
| ct | 673 | 0.4% |
| Other values (10704) | 95912 |
Most occurring characters
| Value | Count | Frequency (%) |
| 141658 | ||
| T | 61198 | 8.4% |
| S | 51646 | 7.1% |
| 3 | 35044 | 4.8% |
| N | 34652 | 4.8% |
| 2 | 34564 | 4.8% |
| 1 | 30139 | 4.1% |
| 4 | 26540 | 3.6% |
| 5 | 25512 | 3.5% |
| H | 24901 | 3.4% |
| Other values (36) | 261807 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 334283 | |
| Decimal Number | 247926 | |
| Space Separator | 141658 | |
| Dash Punctuation | 3708 | 0.5% |
| Lowercase Letter | 54 | < 0.1% |
| Other Punctuation | 25 | < 0.1% |
| Close Punctuation | 4 | < 0.1% |
| Modifier Symbol | 3 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 61198 | |
| S | 51646 | |
| N | 34652 | |
| H | 24901 | 7.4% |
| A | 24430 | 7.3% |
| E | 17510 | 5.2% |
| R | 16397 | 4.9% |
| W | 14271 | 4.3% |
| L | 13440 | 4.0% |
| V | 12596 | 3.8% |
| Other values (16) | 63242 |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 35044 | |
| 2 | 34564 | |
| 1 | 30139 | |
| 4 | 26540 | |
| 5 | 25512 | |
| 0 | 21327 | |
| 6 | 20486 | |
| 7 | 19876 | |
| 8 | 18513 | |
| 9 | 15925 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 19 | |
| \ | 5 | 20.0% |
| . | 1 | 4.0% |
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 18 | |
| i | 18 | |
| t | 18 |
Space Separator
| Value | Count | Frequency (%) |
| 141658 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3708 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 4 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 393324 | |
| Latin | 334337 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| T | 61198 | |
| S | 51646 | |
| N | 34652 | |
| H | 24901 | 7.4% |
| A | 24430 | 7.3% |
| E | 17510 | 5.2% |
| R | 16397 | 4.9% |
| W | 14271 | 4.3% |
| L | 13440 | 4.0% |
| V | 12596 | 3.8% |
| Other values (19) | 63296 |
Common
| Value | Count | Frequency (%) |
| 141658 | ||
| 3 | 35044 | 8.9% |
| 2 | 34564 | 8.8% |
| 1 | 30139 | 7.7% |
| 4 | 26540 | 6.7% |
| 5 | 25512 | 6.5% |
| 0 | 21327 | 5.4% |
| 6 | 20486 | 5.2% |
| 7 | 19876 | 5.1% |
| 8 | 18513 | 4.7% |
| Other values (7) | 19665 | 5.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 727661 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 141658 | ||
| T | 61198 | 8.4% |
| S | 51646 | 7.1% |
| 3 | 35044 | 4.8% |
| N | 34652 | 4.8% |
| 2 | 34564 | 4.8% |
| 1 | 30139 | 4.1% |
| 4 | 26540 | 3.6% |
| 5 | 25512 | 3.5% |
| H | 24901 | 3.4% |
| Other values (36) | 261807 |
District
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 15 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.411256 |
| Minimum | 1 |
|---|---|
| Maximum | 15 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 365.1 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 5 |
| median | 9 |
| Q3 | 12 |
| 95-th percentile | 14 |
| Maximum | 15 |
| Range | 14 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 4.2262715 |
|---|---|
| Coefficient of variation (CV) | 0.50245428 |
| Kurtosis | -1.1947322 |
| Mean | 8.411256 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | -0.21335309 |
| Sum | 392915 |
| Variance | 17.861371 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 6355 | |
| 11 | 5874 | |
| 14 | 4893 | |
| 10 | 4844 | |
| 13 | 4560 | |
| 2 | 3216 | |
| 7 | 2809 | |
| 3 | 2651 | |
| 1 | 2644 | |
| 9 | 2346 | 5.0% |
| Other values (5) | 6521 |
| Value | Count | Frequency (%) |
| 1 | 2644 | |
| 2 | 3216 | |
| 3 | 2651 | |
| 4 | 340 | 0.7% |
| 5 | 6355 | |
| 6 | 1804 | 3.9% |
| 7 | 2809 | |
| 8 | 1697 | 3.6% |
| 9 | 2346 | 5.0% |
| 10 | 4844 |
| Value | Count | Frequency (%) |
| 15 | 1526 | 3.3% |
| 14 | 4893 | |
| 13 | 4560 | |
| 12 | 1154 | 2.5% |
| 11 | 5874 | |
| 10 | 4844 | |
| 9 | 2346 | 5.0% |
| 8 | 1697 | 3.6% |
| 7 | 2809 | |
| 6 | 1804 | 3.9% |
Nbhd
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 184 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2901.9262 |
| Minimum | 40 |
|---|---|
| Maximum | 6470 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 365.1 KiB |
Quantile statistics
| Minimum | 40 |
|---|---|
| 5-th percentile | 560 |
| Q1 | 1740 |
| median | 2840 |
| Q3 | 4340 |
| 95-th percentile | 4780 |
| Maximum | 6470 |
| Range | 6430 |
| Interquartile range (IQR) | 2600 |
Descriptive statistics
| Standard deviation | 1430.9731 |
|---|---|
| Coefficient of variation (CV) | 0.49311148 |
| Kurtosis | -1.2601556 |
| Mean | 2901.9262 |
| Median Absolute Deviation (MAD) | 1400 |
| Skewness | -0.091377298 |
| Sum | 1.3555768 × 108 |
| Variance | 2047684.1 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2100 | 1539 | 3.3% |
| 2080 | 1349 | 2.9% |
| 4520 | 1162 | 2.5% |
| 4340 | 1144 | 2.4% |
| 4240 | 1087 | 2.3% |
| 4420 | 1069 | 2.3% |
| 4620 | 907 | 1.9% |
| 4580 | 788 | 1.7% |
| 4700 | 760 | 1.6% |
| 2040 | 752 | 1.6% |
| Other values (174) | 36156 |
| Value | Count | Frequency (%) |
| 40 | 146 | 0.3% |
| 50 | 65 | 0.1% |
| 240 | 537 | |
| 360 | 247 | 0.5% |
| 380 | 100 | 0.2% |
| 440 | 338 | |
| 480 | 696 | |
| 520 | 73 | 0.2% |
| 560 | 310 | |
| 600 | 193 | 0.4% |
| Value | Count | Frequency (%) |
| 6470 | 1 | < 0.1% |
| 6465 | 2 | < 0.1% |
| 6460 | 10 | |
| 6423 | 1 | < 0.1% |
| 6290 | 3 | < 0.1% |
| 6288 | 4 | < 0.1% |
| 6286 | 2 | < 0.1% |
| 6284 | 16 | |
| 6283 | 8 | |
| 6282 | 7 |
Style
Categorical
HIGH CORRELATION 
| Distinct | 41 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 365.1 KiB |
| Ranch | |
|---|---|
| Cape Cod | |
| Milwaukee Bungalow | |
| Duplex O/S | |
| Residence O/S | |
| Other values (36) |
Length
| Max length | 50 |
|---|---|
| Median length | 42 |
| Mean length | 9.4530645 |
| Min length | 4 |
Characters and Unicode
| Total characters | 441581 |
|---|---|
| Distinct characters | 50 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 11 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | AP 1 |
|---|---|
| 2nd row | Rm or Rooming House |
| 3rd row | Rm or Rooming House |
| 4th row | AP 1 |
| 5th row | Mansion |
Common Values
| Value | Count | Frequency (%) |
| Ranch | 13374 | |
| Cape Cod | 8957 | |
| Milwaukee Bungalow | 3720 | 8.0% |
| Duplex O/S | 3447 | 7.4% |
| Residence O/S | 2926 | 6.3% |
| Colonial | 2884 | 6.2% |
| Dplx Bungalow | 2692 | 5.8% |
| Duplex N/S | 2405 | 5.1% |
| Res O/S A & 1/2 | 1705 | 3.6% |
| Cottage | 1026 | 2.2% |
| Other values (31) | 3577 | 7.7% |
Length
| Value | Count | Frequency (%) |
| ranch | 13375 | |
| cape | 8957 | |
| cod | 8957 | |
| o/s | 8770 | |
| bungalow | 6412 | 7.8% |
| duplex | 5852 | 7.1% |
| milwaukee | 3720 | 4.5% |
| residence | 3247 | 4.0% |
| colonial | 2884 | 3.5% |
| dplx | 2692 | 3.3% |
| Other values (61) | 17333 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 39000 | 8.8% |
| a | 37181 | 8.4% |
| 35486 | 8.0% | |
| l | 27203 | 6.2% |
| n | 27076 | 6.1% |
| o | 25406 | 5.8% |
| C | 22251 | 5.0% |
| R | 19238 | 4.4% |
| p | 19036 | 4.3% |
| u | 17846 | 4.0% |
| Other values (40) | 171858 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 296617 | |
| Uppercase Letter | 88892 | 20.1% |
| Space Separator | 35486 | 8.0% |
| Other Punctuation | 15027 | 3.4% |
| Decimal Number | 4130 | 0.9% |
| Dash Punctuation | 672 | 0.2% |
| Math Symbol | 593 | 0.1% |
| Open Punctuation | 154 | < 0.1% |
| Close Punctuation | 10 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 39000 | |
| a | 37181 | |
| l | 27203 | |
| n | 27076 | |
| o | 25406 | |
| p | 19036 | 6.4% |
| u | 17846 | 6.0% |
| c | 16798 | 5.7% |
| h | 13897 | 4.7% |
| d | 13222 | 4.5% |
| Other values (12) | 59952 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 22251 | |
| R | 19238 | |
| S | 12116 | |
| D | 8947 | |
| O | 8933 | |
| B | 6916 | 7.8% |
| M | 4203 | 4.7% |
| N | 2405 | 2.7% |
| A | 1886 | 2.1% |
| T | 1428 | 1.6% |
| Other values (6) | 569 | 0.6% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2298 | |
| 1 | 1830 | |
| 4 | 1 | < 0.1% |
| 6 | 1 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 12880 | |
| & | 2002 | 13.3% |
| , | 145 | 1.0% |
Space Separator
| Value | Count | Frequency (%) |
| 35486 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 672 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 593 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 154 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 10 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 385509 | |
| Common | 56072 | 12.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 39000 | 10.1% |
| a | 37181 | 9.6% |
| l | 27203 | 7.1% |
| n | 27076 | 7.0% |
| o | 25406 | 6.6% |
| C | 22251 | 5.8% |
| R | 19238 | 5.0% |
| p | 19036 | 4.9% |
| u | 17846 | 4.6% |
| c | 16798 | 4.4% |
| Other values (28) | 134474 |
Common
| Value | Count | Frequency (%) |
| 35486 | ||
| / | 12880 | 23.0% |
| 2 | 2298 | 4.1% |
| & | 2002 | 3.6% |
| 1 | 1830 | 3.3% |
| - | 672 | 1.2% |
| + | 593 | 1.1% |
| ( | 154 | 0.3% |
| , | 145 | 0.3% |
| ) | 10 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 441581 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 39000 | 8.8% |
| a | 37181 | 8.4% |
| 35486 | 8.0% | |
| l | 27203 | 6.2% |
| n | 27076 | 6.1% |
| o | 25406 | 5.8% |
| C | 22251 | 5.0% |
| R | 19238 | 4.4% |
| p | 19036 | 4.3% |
| u | 17846 | 4.0% |
| Other values (40) | 171858 |
Extwall
Categorical
| Distinct | 20 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 365.1 KiB |
| Aluminum / Vinyl | |
|---|---|
| Aluminum/Vinyl | |
| Brick | |
| Frame | |
| Stone | |
| Other values (15) |
Length
| Max length | 23 |
|---|---|
| Median length | 17 |
| Mean length | 11.380237 |
| Min length | 4 |
Characters and Unicode
| Total characters | 531605 |
|---|---|
| Distinct characters | 33 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Frame |
|---|---|
| 2nd row | Frame |
| 3rd row | Frame |
| 4th row | Frame |
| 5th row | Stone |
Common Values
| Value | Count | Frequency (%) |
| Aluminum / Vinyl | 13931 | |
| Aluminum/Vinyl | 13143 | |
| Brick | 10147 | |
| Frame | 2590 | 5.5% |
| Stone | 1615 | 3.5% |
| Asphalt/Other | 1210 | 2.6% |
| Wood | 1171 | 2.5% |
| Stucco | 807 | 1.7% |
| Masonry / Frame | 760 | 1.6% |
| Masonry/Frame | 593 | 1.3% |
| Other values (10) | 746 | 1.6% |
Length
| Value | Count | Frequency (%) |
| 14691 | ||
| aluminum | 13931 | |
| vinyl | 13931 | |
| aluminum/vinyl | 13143 | |
| brick | 10147 | |
| frame | 3361 | 4.4% |
| stone | 1615 | 2.1% |
| wood | 1288 | 1.7% |
| asphalt/other | 1210 | 1.6% |
| stucco | 807 | 1.1% |
| Other values (14) | 2304 | 3.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 64820 | |
| m | 58551 | |
| n | 57664 | |
| l | 55854 | |
| u | 54996 | |
| / | 29820 | 5.6% |
| 29715 | 5.6% | |
| y | 28523 | 5.4% |
| A | 28325 | 5.3% |
| V | 27115 | 5.1% |
| Other values (23) | 96222 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 394906 | |
| Uppercase Letter | 77015 | 14.5% |
| Other Punctuation | 29820 | 5.6% |
| Space Separator | 29715 | 5.6% |
| Dash Punctuation | 149 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 64820 | |
| m | 58551 | |
| n | 57664 | |
| l | 55854 | |
| u | 54996 | |
| y | 28523 | |
| r | 17256 | 4.4% |
| c | 12045 | 3.1% |
| k | 10556 | 2.7% |
| e | 7816 | 2.0% |
| Other values (9) | 26825 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 28325 | |
| V | 27115 | |
| B | 10414 | 13.5% |
| F | 4245 | 5.5% |
| S | 2468 | 3.2% |
| M | 1372 | 1.8% |
| W | 1288 | 1.7% |
| O | 1221 | 1.6% |
| C | 305 | 0.4% |
| H | 142 | 0.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 29820 |
Space Separator
| Value | Count | Frequency (%) |
| 29715 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 149 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 471921 | |
| Common | 59684 | 11.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 64820 | |
| m | 58551 | |
| n | 57664 | |
| l | 55854 | |
| u | 54996 | |
| y | 28523 | |
| A | 28325 | |
| V | 27115 | |
| r | 17256 | 3.7% |
| c | 12045 | 2.6% |
| Other values (20) | 66772 |
Common
| Value | Count | Frequency (%) |
| / | 29820 | |
| 29715 | ||
| - | 149 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 531605 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 64820 | |
| m | 58551 | |
| n | 57664 | |
| l | 55854 | |
| u | 54996 | |
| / | 29820 | 5.6% |
| 29715 | 5.6% | |
| y | 28523 | 5.4% |
| A | 28325 | 5.3% |
| V | 27115 | 5.1% |
| Other values (23) | 96222 |
Stories
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.321463 |
| Minimum | 1 |
|---|---|
| Maximum | 8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 365.1 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1.5 |
| 95-th percentile | 2 |
| Maximum | 8 |
| Range | 7 |
| Interquartile range (IQR) | 0.5 |
Descriptive statistics
| Standard deviation | 0.4275148 |
|---|---|
| Coefficient of variation (CV) | 0.32351629 |
| Kurtosis | 0.28388076 |
| Mean | 1.321463 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.87098293 |
| Sum | 61729.5 |
| Variance | 0.18276891 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 28168 | |
| 2 | 11315 | |
| 1.5 | 7169 | 15.3% |
| 3 | 32 | 0.1% |
| 2.5 | 25 | 0.1% |
| 4 | 2 | < 0.1% |
| 3.5 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 28168 | |
| 1.5 | 7169 | 15.3% |
| 2 | 11315 | |
| 2.5 | 25 | 0.1% |
| 3 | 32 | 0.1% |
| 3.5 | 1 | < 0.1% |
| 4 | 2 | < 0.1% |
| 8 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 8 | 1 | < 0.1% |
| 4 | 2 | < 0.1% |
| 3.5 | 1 | < 0.1% |
| 3 | 32 | 0.1% |
| 2.5 | 25 | 0.1% |
| 2 | 11315 | |
| 1.5 | 7169 | 15.3% |
| 1 | 28168 |
Year_Built
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 176 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1939.5678 |
| Minimum | 1835 |
|---|---|
| Maximum | 2023 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 365.1 KiB |
Quantile statistics
| Minimum | 1835 |
|---|---|
| 5-th percentile | 1895 |
| Q1 | 1923 |
| median | 1948 |
| Q3 | 1956 |
| 95-th percentile | 1972 |
| Maximum | 2023 |
| Range | 188 |
| Interquartile range (IQR) | 33 |
Descriptive statistics
| Standard deviation | 24.891764 |
|---|---|
| Coefficient of variation (CV) | 0.012833665 |
| Kurtosis | 0.084214438 |
| Mean | 1939.5678 |
| Median Absolute Deviation (MAD) | 14 |
| Skewness | -0.24112662 |
| Sum | 90603031 |
| Variance | 619.59992 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1955 | 1906 | 4.1% |
| 1953 | 1792 | 3.8% |
| 1952 | 1688 | 3.6% |
| 1950 | 1635 | 3.5% |
| 1956 | 1615 | 3.5% |
| 1954 | 1491 | 3.2% |
| 1951 | 1362 | 2.9% |
| 1957 | 1282 | 2.7% |
| 1949 | 1151 | 2.5% |
| 1958 | 1109 | 2.4% |
| Other values (166) | 31682 |
| Value | Count | Frequency (%) |
| 1835 | 1 | < 0.1% |
| 1836 | 2 | |
| 1840 | 1 | < 0.1% |
| 1843 | 1 | < 0.1% |
| 1844 | 1 | < 0.1% |
| 1848 | 1 | < 0.1% |
| 1850 | 3 | |
| 1853 | 1 | < 0.1% |
| 1854 | 1 | < 0.1% |
| 1855 | 2 |
| Value | Count | Frequency (%) |
| 2023 | 1 | < 0.1% |
| 2022 | 5 | < 0.1% |
| 2021 | 2 | < 0.1% |
| 2020 | 4 | < 0.1% |
| 2019 | 4 | < 0.1% |
| 2018 | 10 | |
| 2017 | 17 | |
| 2016 | 12 | |
| 2015 | 6 | < 0.1% |
| 2014 | 9 |
Nr_of_rms
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 40 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.4222165 |
| Minimum | 0 |
|---|---|
| Maximum | 63 |
| Zeros | 24929 |
| Zeros (%) | 53.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 365.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 6 |
| 95-th percentile | 11 |
| Maximum | 63 |
| Range | 63 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 4.2513098 |
|---|---|
| Coefficient of variation (CV) | 1.2422679 |
| Kurtosis | 4.0848375 |
| Mean | 3.4222165 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.3359308 |
| Sum | 159862 |
| Variance | 18.073635 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 24929 | |
| 5 | 5668 | 12.1% |
| 6 | 4383 | 9.4% |
| 10 | 2551 | 5.5% |
| 7 | 2209 | 4.7% |
| 8 | 1774 | 3.8% |
| 4 | 1594 | 3.4% |
| 9 | 1107 | 2.4% |
| 12 | 973 | 2.1% |
| 11 | 552 | 1.2% |
| Other values (30) | 973 | 2.1% |
| Value | Count | Frequency (%) |
| 0 | 24929 | |
| 2 | 1 | < 0.1% |
| 3 | 39 | 0.1% |
| 4 | 1594 | 3.4% |
| 5 | 5668 | 12.1% |
| 6 | 4383 | 9.4% |
| 7 | 2209 | 4.7% |
| 8 | 1774 | 3.8% |
| 9 | 1107 | 2.4% |
| 10 | 2551 | 5.5% |
| Value | Count | Frequency (%) |
| 63 | 1 | < 0.1% |
| 62 | 1 | < 0.1% |
| 45 | 2 | |
| 44 | 1 | < 0.1% |
| 40 | 2 | |
| 39 | 1 | < 0.1% |
| 38 | 1 | < 0.1% |
| 36 | 3 | |
| 33 | 1 | < 0.1% |
| 32 | 3 |
Fin_sqft
Real number (ℝ)
HIGH CORRELATION  SKEWED 
| Distinct | 3180 |
|---|---|
| Distinct (%) | 6.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1547.8492 |
| Minimum | 256 |
|---|---|
| Maximum | 81865 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 365.1 KiB |
Quantile statistics
| Minimum | 256 |
|---|---|
| 5-th percentile | 852 |
| Q1 | 1092 |
| median | 1355 |
| Q3 | 1847 |
| 95-th percentile | 2729 |
| Maximum | 81865 |
| Range | 81609 |
| Interquartile range (IQR) | 755 |
Descriptive statistics
| Standard deviation | 847.89721 |
|---|---|
| Coefficient of variation (CV) | 0.54779059 |
| Kurtosis | 2162.5248 |
| Mean | 1547.8492 |
| Median Absolute Deviation (MAD) | 324 |
| Skewness | 27.237468 |
| Sum | 72304678 |
| Variance | 718929.69 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 864 | 256 | 0.5% |
| 936 | 227 | 0.5% |
| 672 | 170 | 0.4% |
| 1120 | 162 | 0.3% |
| 1008 | 156 | 0.3% |
| 1176 | 140 | 0.3% |
| 1092 | 131 | 0.3% |
| 1200 | 129 | 0.3% |
| 1150 | 123 | 0.3% |
| 1064 | 116 | 0.2% |
| Other values (3170) | 45103 |
| Value | Count | Frequency (%) |
| 256 | 1 | < 0.1% |
| 416 | 1 | < 0.1% |
| 452 | 1 | < 0.1% |
| 484 | 1 | < 0.1% |
| 487 | 1 | < 0.1% |
| 500 | 2 | |
| 504 | 4 | |
| 512 | 1 | < 0.1% |
| 518 | 1 | < 0.1% |
| 520 | 2 |
| Value | Count | Frequency (%) |
| 81865 | 1 | |
| 57137 | 1 | |
| 26930 | 1 | |
| 21000 | 1 | |
| 19477 | 2 | |
| 16080 | 1 | |
| 15904 | 1 | |
| 13596 | 1 | |
| 12775 | 1 | |
| 12411 | 1 |
Units
Real number (ℝ)
HIGH CORRELATION  SKEWED 
| Distinct | 22 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.2694753 |
| Minimum | 0 |
|---|---|
| Maximum | 431 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 365.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 431 |
| Range | 431 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 2.2604908 |
|---|---|
| Coefficient of variation (CV) | 1.7806497 |
| Kurtosis | 29026.479 |
| Mean | 1.2694753 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 160.01387 |
| Sum | 59301 |
| Variance | 5.1098186 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 36222 | |
| 2 | 9635 | 20.6% |
| 3 | 636 | 1.4% |
| 4 | 117 | 0.3% |
| 5 | 28 | 0.1% |
| 6 | 27 | 0.1% |
| 7 | 12 | < 0.1% |
| 8 | 8 | < 0.1% |
| 10 | 5 | < 0.1% |
| 9 | 5 | < 0.1% |
| Other values (12) | 18 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 1 | 36222 | |
| 2 | 9635 | 20.6% |
| 3 | 636 | 1.4% |
| 4 | 117 | 0.3% |
| 5 | 28 | 0.1% |
| 6 | 27 | 0.1% |
| 7 | 12 | < 0.1% |
| 8 | 8 | < 0.1% |
| 9 | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 431 | 1 | < 0.1% |
| 191 | 1 | < 0.1% |
| 34 | 1 | < 0.1% |
| 31 | 1 | < 0.1% |
| 28 | 1 | < 0.1% |
| 27 | 1 | < 0.1% |
| 23 | 1 | < 0.1% |
| 16 | 1 | < 0.1% |
| 13 | 3 | |
| 12 | 2 |
Bdrms
Real number (ℝ)
HIGH CORRELATION  SKEWED 
| Distinct | 25 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.5903282 |
| Minimum | 0 |
|---|---|
| Maximum | 2031 |
| Zeros | 12 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 365.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 3 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 2031 |
| Range | 2031 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 9.4739969 |
|---|---|
| Coefficient of variation (CV) | 2.6387551 |
| Kurtosis | 44898.466 |
| Mean | 3.5903282 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 209.81198 |
| Sum | 167715 |
| Variance | 89.756617 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 22344 | |
| 4 | 11133 | |
| 2 | 6002 | 12.8% |
| 6 | 3268 | 7.0% |
| 5 | 2699 | 5.8% |
| 8 | 397 | 0.8% |
| 7 | 336 | 0.7% |
| 1 | 231 | 0.5% |
| 9 | 88 | 0.2% |
| 10 | 80 | 0.2% |
| Other values (15) | 135 | 0.3% |
| Value | Count | Frequency (%) |
| 0 | 12 | < 0.1% |
| 1 | 231 | 0.5% |
| 2 | 6002 | 12.8% |
| 3 | 22344 | |
| 4 | 11133 | |
| 5 | 2699 | 5.8% |
| 6 | 3268 | 7.0% |
| 7 | 336 | 0.7% |
| 8 | 397 | 0.8% |
| 9 | 88 | 0.2% |
| Value | Count | Frequency (%) |
| 2031 | 1 | < 0.1% |
| 32 | 1 | < 0.1% |
| 29 | 1 | < 0.1% |
| 28 | 1 | < 0.1% |
| 25 | 2 | < 0.1% |
| 21 | 1 | < 0.1% |
| 20 | 4 | |
| 18 | 6 | |
| 16 | 2 | < 0.1% |
| 15 | 6 |
Fbath
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.4840623 |
| Minimum | 0 |
|---|---|
| Maximum | 10 |
| Zeros | 248 |
| Zeros (%) | 0.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 365.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 2 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.6236543 |
|---|---|
| Coefficient of variation (CV) | 0.4202346 |
| Kurtosis | 2.8233064 |
| Mean | 1.4840623 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.1180416 |
| Sum | 69325 |
| Variance | 0.38894469 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 26151 | |
| 2 | 18121 | |
| 3 | 1910 | 4.1% |
| 0 | 248 | 0.5% |
| 4 | 233 | 0.5% |
| 5 | 37 | 0.1% |
| 6 | 10 | < 0.1% |
| 10 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 248 | 0.5% |
| 1 | 26151 | |
| 2 | 18121 | |
| 3 | 1910 | 4.1% |
| 4 | 233 | 0.5% |
| 5 | 37 | 0.1% |
| 6 | 10 | < 0.1% |
| 7 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 10 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
| 6 | 10 | < 0.1% |
| 5 | 37 | 0.1% |
| 4 | 233 | 0.5% |
| 3 | 1910 | 4.1% |
| 2 | 18121 | |
| 1 | 26151 | |
| 0 | 248 | 0.5% |
Hbath
Categorical
IMBALANCE 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 365.1 KiB |
| 0 | |
|---|---|
| 1 | |
| 2 | 1151 |
| 3 | 51 |
| 10 | 1 |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.0000214 |
| Min length | 1 |
Characters and Unicode
| Total characters | 46714 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 31527 | |
| 1 | 13983 | |
| 2 | 1151 | 2.5% |
| 3 | 51 | 0.1% |
| 10 | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 31527 | |
| 1 | 13983 | |
| 2 | 1151 | 2.5% |
| 3 | 51 | 0.1% |
| 10 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 31528 | |
| 1 | 13984 | |
| 2 | 1151 | 2.5% |
| 3 | 51 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 46714 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 31528 | |
| 1 | 13984 | |
| 2 | 1151 | 2.5% |
| 3 | 51 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 46714 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 31528 | |
| 1 | 13984 | |
| 2 | 1151 | 2.5% |
| 3 | 51 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 46714 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 31528 | |
| 1 | 13984 | |
| 2 | 1151 | 2.5% |
| 3 | 51 | 0.1% |
Lotsize
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 4408 |
|---|---|
| Distinct (%) | 9.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6157.6553 |
| Minimum | 0 |
|---|---|
| Maximum | 227819 |
| Zeros | 200 |
| Zeros (%) | 0.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 365.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3300 |
| Q1 | 4730 |
| median | 5400 |
| Q3 | 7168 |
| 95-th percentile | 10692 |
| Maximum | 227819 |
| Range | 227819 |
| Interquartile range (IQR) | 2438 |
Descriptive statistics
| Standard deviation | 3953.8268 |
|---|---|
| Coefficient of variation (CV) | 0.6420994 |
| Kurtosis | 817.44476 |
| Mean | 6157.6553 |
| Median Absolute Deviation (MAD) | 1132 |
| Skewness | 18.801894 |
| Sum | 2.8764255 × 108 |
| Variance | 15632746 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4800 | 3419 | 7.3% |
| 3600 | 2292 | 4.9% |
| 6000 | 1272 | 2.7% |
| 5400 | 1230 | 2.6% |
| 7200 | 1122 | 2.4% |
| 5000 | 951 | 2.0% |
| 4920 | 725 | 1.6% |
| 4200 | 609 | 1.3% |
| 5040 | 519 | 1.1% |
| 5160 | 458 | 1.0% |
| Other values (4398) | 34116 |
| Value | Count | Frequency (%) |
| 0 | 200 | |
| 1 | 2 | < 0.1% |
| 613 | 1 | < 0.1% |
| 930 | 2 | < 0.1% |
| 1018 | 1 | < 0.1% |
| 1050 | 6 | < 0.1% |
| 1084 | 1 | < 0.1% |
| 1098 | 1 | < 0.1% |
| 1120 | 1 | < 0.1% |
| 1188 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 227819 | 1 | |
| 219978 | 2 | |
| 209524 | 1 | |
| 128502 | 1 | |
| 119790 | 1 | |
| 101059 | 1 | |
| 95832 | 1 | |
| 90000 | 1 | |
| 84071 | 1 | |
| 83200 | 1 |
Sale_date
Date
| Distinct | 212 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 365.1 KiB |
| Minimum | 2002-02-01 00:00:00 |
|---|---|
| Maximum | 2023-12-01 00:00:00 |
Sale_price
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 4475 |
|---|---|
| Distinct (%) | 9.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 610555.46 |
| Minimum | 0 |
|---|---|
| Maximum | 26250000 |
| Zeros | 30 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 365.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 51000 |
| Q1 | 113000 |
| median | 163000 |
| Q3 | 488400 |
| 95-th percentile | 2600000 |
| Maximum | 26250000 |
| Range | 26250000 |
| Interquartile range (IQR) | 375400 |
Descriptive statistics
| Standard deviation | 984902.32 |
|---|---|
| Coefficient of variation (CV) | 1.6131251 |
| Kurtosis | 38.536111 |
| Mean | 610555.46 |
| Median Absolute Deviation (MAD) | 70300 |
| Skewness | 3.7500625 |
| Sum | 2.8520877 × 1010 |
| Variance | 9.7003258 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 150000 | 448 | 1.0% |
| 125000 | 395 | 0.8% |
| 140000 | 395 | 0.8% |
| 135000 | 390 | 0.8% |
| 130000 | 389 | 0.8% |
| 110000 | 382 | 0.8% |
| 120000 | 377 | 0.8% |
| 160000 | 369 | 0.8% |
| 115000 | 362 | 0.8% |
| 165000 | 356 | 0.8% |
| Other values (4465) | 42850 |
| Value | Count | Frequency (%) |
| 0 | 30 | |
| 100 | 1 | < 0.1% |
| 1000 | 1 | < 0.1% |
| 1100 | 1 | < 0.1% |
| 1250 | 1 | < 0.1% |
| 2000 | 1 | < 0.1% |
| 3500 | 1 | < 0.1% |
| 3900 | 1 | < 0.1% |
| 4000 | 1 | < 0.1% |
| 5000 | 9 | < 0.1% |
| Value | Count | Frequency (%) |
| 26250000 | 1 | |
| 25000000 | 1 | |
| 20800000 | 1 | |
| 19190000 | 1 | |
| 16090000 | 1 | |
| 15500000 | 1 | |
| 14370000 | 1 | |
| 13400000 | 1 | |
| 13000000 | 1 | |
| 12920500 | 1 |
| Bdrms | District | Extwall | Fbath | Fin_sqft | Hbath | Lotsize | Nbhd | Nr_of_rms | PropType | Sale_price | Stories | Style | Units | Year_Built | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Bdrms | 1.000 | -0.022 | 0.000 | 0.509 | 0.667 | 0.000 | -0.084 | -0.022 | 0.294 | 0.000 | 0.175 | 0.500 | 0.010 | 0.534 | -0.170 |
| District | -0.022 | 1.000 | 0.109 | 0.009 | 0.000 | 0.111 | -0.078 | 0.728 | -0.042 | 0.024 | 0.074 | 0.017 | 0.260 | 0.021 | -0.100 |
| Extwall | 0.000 | 0.109 | 1.000 | 0.064 | 0.306 | 0.091 | 0.047 | 0.214 | 0.181 | 0.331 | 0.086 | 0.287 | 0.307 | 0.449 | 0.199 |
| Fbath | 0.509 | 0.009 | 0.064 | 1.000 | 0.611 | 0.151 | -0.086 | 0.029 | 0.199 | 0.032 | 0.173 | 0.476 | 0.312 | 0.563 | -0.187 |
| Fin_sqft | 0.667 | 0.000 | 0.306 | 0.611 | 1.000 | 0.012 | -0.087 | 0.029 | 0.219 | 0.180 | 0.241 | 0.711 | 0.731 | 0.610 | -0.265 |
| Hbath | 0.000 | 0.111 | 0.091 | 0.151 | 0.012 | 1.000 | 0.021 | 0.100 | 0.062 | 0.027 | 0.057 | 0.098 | 0.351 | 0.000 | 0.172 |
| Lotsize | -0.084 | -0.078 | 0.047 | -0.086 | -0.087 | 0.021 | 1.000 | -0.171 | -0.083 | 0.016 | 0.106 | -0.173 | 0.068 | -0.221 | 0.605 |
| Nbhd | -0.022 | 0.728 | 0.214 | 0.029 | 0.029 | 0.100 | -0.171 | 1.000 | -0.085 | 0.816 | 0.144 | 0.065 | 0.513 | 0.051 | -0.229 |
| Nr_of_rms | 0.294 | -0.042 | 0.181 | 0.199 | 0.219 | 0.062 | -0.083 | -0.085 | 1.000 | 0.018 | 0.559 | 0.198 | 0.211 | 0.240 | -0.121 |
| PropType | 0.000 | 0.024 | 0.331 | 0.032 | 0.180 | 0.027 | 0.016 | 0.816 | 0.018 | 1.000 | 0.083 | 0.101 | 0.814 | 0.068 | 0.042 |
| Sale_price | 0.175 | 0.074 | 0.086 | 0.173 | 0.241 | 0.057 | 0.106 | 0.144 | 0.559 | 0.083 | 1.000 | 0.176 | 0.163 | 0.038 | 0.043 |
| Stories | 0.500 | 0.017 | 0.287 | 0.476 | 0.711 | 0.098 | -0.173 | 0.065 | 0.198 | 0.101 | 0.176 | 1.000 | 0.583 | 0.607 | -0.282 |
| Style | 0.010 | 0.260 | 0.307 | 0.312 | 0.731 | 0.351 | 0.068 | 0.513 | 0.211 | 0.814 | 0.163 | 0.583 | 1.000 | 1.000 | 0.431 |
| Units | 0.534 | 0.021 | 0.449 | 0.563 | 0.610 | 0.000 | -0.221 | 0.051 | 0.240 | 0.068 | 0.038 | 0.607 | 1.000 | 1.000 | -0.270 |
| Year_Built | -0.170 | -0.100 | 0.199 | -0.187 | -0.265 | 0.172 | 0.605 | -0.229 | -0.121 | 0.042 | 0.043 | -0.282 | 0.431 | -0.270 | 1.000 |
| PropType | Address | District | Nbhd | Style | Extwall | Stories | Year_Built | Nr_of_rms | Fin_sqft | Units | Bdrms | Fbath | Hbath | Lotsize | Sale_date | Sale_price | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Residential | 3033 N 35TH ST | 7 | 2960 | AP 1 | Frame | 2.0 | 1913 | 0 | 3476 | 4 | 9 | 1 | 0 | 5040 | 2002-02 | 42000 |
| 1 | Residential | 1908 E WEBSTER PL | 3 | 3170 | Rm or Rooming House | Frame | 2.0 | 1897 | 0 | 1992 | 4 | 2 | 2 | 0 | 2880 | 2002-05 | 145000 |
| 2 | Residential | 812 N 25TH ST | 4 | 3040 | Rm or Rooming House | Frame | 2.0 | 1907 | 0 | 2339 | 6 | 0 | 1 | 0 | 3185 | 2002-06 | 30000 |
| 3 | Residential | 959 N 34TH ST | 4 | 2300 | AP 1 | Frame | 2.0 | 1890 | 0 | 2329 | 4 | 4 | 1 | 0 | 5781 | 2002-10 | 66500 |
| 4 | Residential | 3209 W WELLS ST | 4 | 2300 | Mansion | Stone | 2.5 | 1891 | 0 | 7450 | 2 | 7 | 6 | 0 | 15600 | 2002-11 | 150500 |
| 5 | Residential | 2143 S 11TH ST | 12 | 4120 | Duplex O/S | Frame | 1.5 | 1906 | 0 | 2462 | 2 | 3 | 2 | 0 | 5075 | 2002-11 | 75000 |
| 6 | Residential | 1116 N 13TH ST | 4 | 3040 | Rm or Rooming House | Frame | 1.5 | 1890 | 0 | 2372 | 6 | 2 | 2 | 0 | 7750 | 2002-12 | 35000 |
| 7 | Residential | 3350 W RUSKIN ST | 11 | 4400 | Cape Cod | Brick | 1.0 | 1950 | 0 | 1149 | 1 | 3 | 1 | 0 | 4800 | 2003-06 | 75000 |
| 8 | Residential | 4826 N 51ST BL | 1 | 1150 | Cape Cod | Aluminum / Vinyl | 1.0 | 1947 | 0 | 994 | 1 | 3 | 1 | 0 | 4200 | 2003-08 | 22000 |
| 9 | Residential | 3706A W SHERIDAN AV | 1 | 1160 | AP 1 | Stucco | 2.0 | 1905 | 0 | 2938 | 4 | 3 | 1 | 0 | 9480 | 2003-12 | 125000 |
| PropType | Address | District | Nbhd | Style | Extwall | Stories | Year_Built | Nr_of_rms | Fin_sqft | Units | Bdrms | Fbath | Hbath | Lotsize | Sale_date | Sale_price | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 46703 | Residential | 5934 S 18TH ST | 13 | 4860 | Ranch | Aluminum/Vinyl | 1.0 | 1966 | 6 | 1421 | 1 | 3 | 1 | 1 | 13090 | 2019-07 | 2387500 |
| 46704 | Residential | 2135 W HENRY AV | 13 | 4860 | Ranch | Brick | 1.0 | 1962 | 6 | 1507 | 1 | 3 | 1 | 1 | 12996 | 2019-10 | 1600000 |
| 46705 | Residential | 6145 S 23RD ST | 13 | 4860 | Cape Cod | Aluminum/Vinyl | 1.0 | 2004 | 6 | 1997 | 1 | 3 | 2 | 1 | 7200 | 2019-03 | 2800000 |
| 46706 | Residential | 6235 S 26TH ST | 13 | 4860 | Ranch | Aluminum/Vinyl | 1.0 | 1963 | 5 | 1138 | 1 | 3 | 1 | 1 | 5330 | 2019-06 | 1650000 |
| 46707 | Residential | 2235 W BRIDGE ST | 13 | 4860 | Ranch | Aluminum/Vinyl | 1.0 | 1967 | 6 | 1516 | 1 | 3 | 1 | 1 | 8990 | 2019-05 | 2335000 |
| 46708 | Residential | 2418 W KIMBERLY AV | 13 | 4860 | Milwaukee Bungalow | Aluminum/Vinyl | 1.0 | 1928 | 6 | 1375 | 1 | 3 | 1 | 1 | 8398 | 2019-11 | 1849000 |
| 46709 | Residential | 6687 S 19TH ST | 13 | 4920 | Ranch | Aluminum/Vinyl | 1.0 | 1960 | 5 | 981 | 1 | 3 | 1 | 1 | 6000 | 2019-06 | 1950000 |
| 46710 | Residential | 6815 S 19TH ST | 13 | 4920 | Ranch | Aluminum/Vinyl | 1.0 | 1961 | 5 | 1110 | 1 | 3 | 1 | 1 | 6240 | 2019-07 | 1780000 |
| 46711 | Residential | 1800 W ASPEN ST | 13 | 4920 | Ranch | Aluminum/Vinyl | 1.0 | 1961 | 6 | 1108 | 1 | 3 | 1 | 1 | 7800 | 2019-07 | 1860000 |
| 46712 | Residential | 1747 W ASPEN ST | 13 | 4920 | Ranch | Aluminum/Vinyl | 1.0 | 1964 | 5 | 891 | 1 | 3 | 1 | 1 | 6500 | 2019-06 | 1535000 |
Most frequently occurring
| PropType | Address | District | Nbhd | Style | Extwall | Stories | Year_Built | Nr_of_rms | Fin_sqft | Units | Bdrms | Fbath | Hbath | Lotsize | Sale_date | Sale_price | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 26 | Residential | 5853 N 74TH ST | 2 | 980 | Ranch | Aluminum / Vinyl | 1.0 | 1952 | 0 | 969 | 1 | 2 | 1 | 0 | 4800 | 2009-05 | 5000 | 3 |
| 0 | Residential | 10939 W CAMERON AV | 5 | 1040 | Ranch | Brick | 1.0 | 1960 | 6 | 1015 | 1 | 3 | 1 | 1 | 7345 | 2021-09 | 150000 | 2 |
| 1 | Residential | 1308 S LAYTON BL | 8 | 4000 | Colonial | Frame | 2.0 | 1922 | 0 | 1824 | 1 | 4 | 1 | 1 | 5590 | 2009-12 | 101000 | 2 |
| 2 | Residential | 1756 N HI MOUNT BL | 10 | 2580 | Residence O/S | Masonry / Frame | 2.0 | 1915 | 0 | 2728 | 1 | 5 | 2 | 1 | 8100 | 2017-04 | 335000 | 2 |
| 3 | Residential | 1937 S 5TH ST | 12 | 4120 | Duplex O/S | Aluminum / Vinyl | 1.5 | 1910 | 0 | 1846 | 2 | 6 | 2 | 0 | 2880 | 2016-06 | 63000 | 2 |
| 4 | Residential | 2012 W ORCHARD ST | 8 | 4100 | Residence O/S | Aluminum / Vinyl | 1.0 | 1892 | 0 | 1523 | 1 | 4 | 2 | 0 | 4200 | 2010-05 | 80000 | 2 |
| 5 | Residential | 2021 W PLAINFIELD AV | 13 | 4660 | Ranch | Brick | 1.0 | 1965 | 5 | 1150 | 1 | 3 | 1 | 1 | 5900 | 2019-01 | 1579000 | 2 |
| 6 | Residential | 230 W MARTIN LA | 13 | 4740 | Ranch | Brick | 1.0 | 1958 | 0 | 990 | 1 | 3 | 1 | 0 | 7650 | 2016-09 | 156000 | 2 |
| 7 | Residential | 2554 N 46TH ST | 15 | 2520 | Milwaukee Bungalow | Aluminum / Vinyl | 1.0 | 1919 | 0 | 1517 | 1 | 3 | 2 | 0 | 5000 | 2014-12 | 95000 | 2 |
| 8 | Residential | 2636 S 65TH ST | 11 | 4240 | Cape Cod | Block | 1.0 | 1949 | 0 | 1406 | 1 | 4 | 2 | 0 | 4900 | 2015-07 | 159500 | 2 |